An application of Shoenfield's absoluteness theorem 
to the theory of uniform distribution 



Martin Goldstern 



October 1992/ January 1993 



ABSTRACT. Theorem: If C is a Polish probabihty space, W C 
w'^ X C a Borel set whose sections {x & u'^) have measure 
one and are decreasing {x < x' ^ ^ W^i), then the set 

has measure one. 
We give two proofs of this theorem: one in the language of 
set theory, the other in the language of probability theory, and 
we apply the theorem to a question on completely uniformly 
distributed sequences. 
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ShoenEeld's absoluteness theorem and uniform distribution 



Introduction. 

The law of large numbers implies that almost every infinite sequence in {0, 1} is uniformly 
distributed, i.e., for almost all t e '^2 we have lim^^oo n) = 0, where 2 = {0,1}, 

\{i<n:t{i)=0}\ _ 1 
n 2 



le equidistributed 



oj = {0, 1,2,.. .}, and the "discrepancy" D is defined by D{t, n) = 2 

and "almost all" refers to the product measure on ^2 induced by t 
measure on {0, 1}, i.e., A*({0}) = //({I}) = |. 

A similar argument shows that for all A;, almost all sequences are A;-uniformly distributed, 
i.e., for all for almost all t e '^2 we have lim„_^oo n) = 0, where 

\{i<n: {t{i),...,t{i + k -1)) = w}\ _ 1_ 
n ¥ 



Dk{t,n) := 2^ ■ max 



Since the measure is countably additive, we can switch the quantifiers "for all A;" and "for 
almost all t" and get: 

for almost alH e '^2, for all fc, t is fc-uniformly distributed 

If we replace the constant /c by a function s e we are led to the following concept: A 
sequence t e ^^2 is called s-uniformly distributed, if lim^^oo Ds{n){^i ^) = 0- 
Let Rs be the set of s-uniformly distributed sequences. It is easy to see that s must not 
grow faster than logn (in this paper, log always means the logarithm to the base 2) if the 
set Kg should be nonempty. Flajolet-Kirschcnhofer-Tichy and Grill determined the exact 
bound for s below which still almost all sequences are s-uniformly distributed: 

1. Theorem ([2], [4]): \x{Rs) = 1 iff </9s — >^ oo, where <^s(n) = [logn — log logn — s(n)J. 
See also [3] for a generalization of this result. 



Now let 

s 

where s ranges over all functions with (/p^ — > oo. Since R is an uncountable intersection 
of measure one sets, it is a priori not clear if R has measure one, or indeed if R can be 
nonempty. 

We will show below that R must have measure 1. That is, as in the remark about k- 
uniformly distributed, we can again switch quantifiers and get 

2. Theorem: For almost all t e '^2, for all s for which (^^ ^ oo we have: t is s-uniformly 
distributed. 

We will give two proofs of this theorem: One for logicians, using the technique of forcing and 
Shoenfield's absoluteness theorem for S2-sets, and the second for those mathematicians 
who are more familiar with the language of probability theory. Here we use von Neumann's 
theorem that Borel sets can be uniformized by measurable functions. 

3. Notation: oj is the set of natural numbers. 2 = {0, 1}. 
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^u} is the Bairc space ( = all functions from uj to uj) with the usual metric and topology. 
The variable x always ranges over elements of ^uj. This spaces is naturally ordered by 
xi<X2 -v^ \/n xiin) < X2{n) . 

ijj/ijj is the set of sequences which diverge to infinity, a subspace of '^a;. The variable y 
always ranges over elements of lo/'lo. 

Let C be any perfect Polish space (i.e., complete metric separable) with a complete prob- 
ability measure [x (i.e., [x is cr-additive, all Borel sets are measurable, and all subsets of 
measure zero sets are measurable). The reader may think of C = ^^2, the set of all functions 
from a; to 2. The variable t always ranges over C. 

Analytic sets (also called T\ sets) are those subsets of C which can be obtained as projec- 
tions of Borel sets in C x ^u) (or equivalently, are a continuous image of uj'^\ A subset of 
C is coanalytic or Ilj; iff its complement is analytic. (All propositions below will remain 
true if "Sj" and "11^" are replaced by "Borel".) It is well known that all analytic sets are 
measurable. ([6], see also [7, 2H.8]) 

All facts from descriptive set theory that we use here can be found in Moschovakis book 
[7] . For facts and references about forcing consult [5] . 

4. Acknowledgement: Thanks to Reinhard Winkler for interesting dicussions about 
random numbers. 



The main lemma and its application 

5. Main Lemma: Assume that for each y & uj /'uj we have a set Uy (^C such that 

(A) The set {{y,t) : y e uj/uj, t G Uy} is a 11 j; subset of C u/uj x C. 

(B) For aU y e uj/u, iJ,{Uy) = 1 

(C) li y < y' (that means \/ny{n) < y'{n)), then Uy C Uy' 
Then fi{nye..^o.Uy) = l. 

We will actually not prove the lemma itself, but an equivalent version: 

6. Lemma: Assume that for each x G we have a set Wx C C such that 
(A') The set {{x, t) : x e'^oo.t e W^} is a Il\ set C '^a; x C. 

(B') For aU x E ^u, ij{Wx) = 1 
(C) If X < x', then Wx 5 W^' 
Then liiDxe^^Wx) = 1. 

Proof that lemma 6 implies lemma 5: For every unbounded x G '^uj we define its "inverse" 
function x* E u> yu> by 

x*{n) = min{m : x{m) > n} 

Now given a family {Uy : y E uj /uj) let Wx ■= Ux* for all unbounded x G ^uu, and Wx = C 
otherwise. 

We claim OyUy — Ox^x- The inclusion "C" is clear. For the converse inclusion, it is 
enough to see 

\/y E UJ /uj 3x E '^uj : X* <y 
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Given y & u/^u, we can find a sequence x which increases so fast that for all n, x{y{n)) > n, 
e.g., x{m) max{A; : y{k) < m}. So x*{n) = min{i : x{i) > n} < y{n). 
Hence f]^ Wx = f]y Uy has measure 1. 

7. Application: let C = '^2 with the usual product measure. Let s range over functions 
in satisfying (/^s > and oo, where (fig = \}ogn — log log n — s{n)\. Let Rs be 
the set of s- uniformly distributed sequences in 2^^, and let R :— Rg- It is easy to check 
that s < s' implies Rg' Q Rs, and (p^^ = s. 

For each function y E oj /"uj let 

U =i^^y ^ - ~ log log 
^ 12'^ otherwise 

Now check that all the assumptions of the main lemma are satisfied. If oo, then 

letting y := (pg we have s = (py, so Rg ^ R^p^ = Uy. Hence R = f]gRg has measure 1. 

8. Remark: Note that the inclusion in lemma 5(C) or lemma 6(C') cannot be replaced 

by an "almost" inclusion (modulo measure zero sets or even modulo finite sets): Let F be 
a Borel isomorphism between '^lo and C (see [7, 1G4]), and let Wx = C — {F{x)}, then 

OxW.^di. 

9. Other applications: Almost every metric result in which a o( ) appears can be 
strenghtened by using lemma 5. For example, Drmota- Winkler showed that 

If s{N) = o{^N/\ogN), then fi{Tg) = 1, 

where Tg is the set of s- uniformly distributed sequences x e [0, 1]'^ (see [1] for definitions 
and references), and s{N) = o{y^N/ logN) means that limiv s{N) I ^NjXogN = 0. 

Now for y e let Sy{N) = 

lemma we immediately get that 

s 

where the intersection is taken over all s satisfying s{N) — o{\J N/ logN). 



^N/\ogN y{N) 



and let 



Uy = 



Tg Applying the 
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Proof of lemma 6 for logicians 

The set W := {t : Wxt E Wx} is a Tl\ set, hence measurable. Assume that n{W) < 1. So 
there exists a Borel (Gs) set A with < 1 such that W C A. Note that the statement 

(*) Vt : [Vxt eWa, teA] 

is a Ha-statement, hence absolute between any two transitive universe with the same 
ordinals ([9], see also [7, 8F10]). 

Now let r be a random real over the universe V , r & C — A, and work in V[r]. Random 
forcing is '^w-bounding, i.e., we have 

\/xe'^u3x' e'^unV :x<x' 

Since the condition (C) is also absolute, this implies Vx e ^(jj3x' e ^oj DV : D W^', 
and so 

Moreover, 

(since r is a random real and each W^' contains a Borel set of full measure), so together 
we get 

Va; e '^a; : r e Wa^. 
Since (*) holds also in V[r], we get V[r] \= r E A, a contradiction. 

Proof of lemma 6 for probabilists 

Now we repeat our argument in the language of probability theory. Our main tool is von 
Neumann's selection theorem ([8], see also [7, 4E9]). 
We let our probability space be C. 

We will consider random variables (= measurable functions) T from C into C, and random 
variables X from C into ^oj. 

10. Definition: Let 5 C C x "^oj. We say that / "uniformizes" B iff 

(1) /CS 

(2) / is a partial function. 

(3) dom{f) ^{t:3x (t, x) G B}. 

The von Neumann selection theorem says that every set can be uniformized by a 
measurable function. (Note that the possibly better known Tl\ uniformization theorem 
will in general not yield measurable functions [7, 5A7].) 
Hence we get: 

11. Lemma: Let S C C x '^w be a set. Assume that Wt e C3x e^^uj (t, x) G B. Then 

(1) 3Xn[{I,X) G -B] = 1, where / is the identity function from C to C, and X ranges 
over all random variables in ^u. 

(2) Moreover, VT3X/i[(r, X) G -B] = 1. Here, T and X range over random variables in 
C and ^(jj, respectively. 

4 



ShoenEeld's absoluteness theorem and uniform distribution 



Proof: (1) — There is a measurable function f Q B with domain C. So for all t e C, 
{tJit))eB. LetX:=f. 

(2) — Let / be the function from (1), and let X = / o T. 

12. Remark: Those logicians who have not skipped this section will notice that (2) is 
again Shoenfield's theorem, since random variables naturally correspond to IB-names of 
reals (where IB is the random algebra). 

We will also need the following easy fact about random variables X : C — > ^oj (which 
corresponds to the fact that random forcing is '^cj-bounding) : 

13. Fact: Let X : C ^ '^a; be a random variable. Then there is a family {xn : n e a;) of 
functions in ^oj such that 

fx[3nX < Xn] — 1 

Proof: Call a Borel set A C C of positive measure "good" if there is a function xa G '^cj 
such that A C [X < xa]- If there are finitely many good sets covering C (up to a measure 
zero set) then we are done. Otherwise, let (A„ : n E u) he a, maximal antichain of good 
sets, i.e., a maximal family of sets which are 

(1) good 

(2) pairwise disjoint 

Such a sequence can be found using Zorn's lemma. Again, if covers C up to a 

measure zero set then we are done. So assume that A :— C — [j^ A^ has positive measure e. 
Define a sequence (kn : n e a;) of natural numbers such that for each n the set [X{n) > kn] 
has measure < e/2""'"^. Thus, the set [VnX(n) < kn] r\ A has measure > £ — e/4 > 0, 
contradicting the maximality of the family [An : n & uj). 

14. Second proof of lemma 6: Let / : C ^ C be the identity function. Clearly, for 

any x E'^lu we have //[/ e Wx] = fJ'iWx) = 1. 

We now claim that the same is still true if we replace x by a random variable X. Indeed, let 
X be a random variable, X : C — > ^w. Then we can find a family of functions {xn : n e w) 
such that 

fj,[3nX < Xn] = 1 
Using the anti-monotonicity of the family {Wx : x G ^w) we get 

l^[3nWx2WxJ=l 
and so: iJ,[Wx 2 Dn = 1- Hence 

(*) e Wx] > G fl WxJ = //(fl Wx„) = l 

n n 

for all random variables X. So for all X ^ Wx] = 0, hence by the contrapositive of 
lemma 11 it is impossible that \/t3x t ^ Wx- Therefore 

3t\/x t G Wx 
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So the set f]^ is nonempty. Relativizing the above argument to any positive Borel set 
A we can show that the set A n f]^ is nonempty. Hence W has outer measure 1. As 
remarked above, W must be measurable, so indeed ij,{W) — 1. 
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